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ABSTRACT 

We apply the technique of self-organising maps l|Kohonenl 1 19901) to the automated 
classification of singly periodic astronomical lightcurves. We find that our maps read- 
ily distinguish between lightcurve types in both synthetic and real datasets, and that 
the resulting maps do not depend sensitively on the chosen learning parameters. Au- 
tomated data analysis techniques are likely to be become increasingly important as 
the size of astronomical datasets continues to increase, particularly with the advent 
of ultra- wide- field survey telescopes such as WASP, RAPTOR and AS AS. 
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1 INTRODUCTION 

Ultra-wide-field optical surveys are now capable of rou- 
tinely acquiring high-precision time-resolved photometry of 
large samples of celestial objects. S uch projects include th e 
All Sky Automated Survey, AS AS l|Poimanskilll997 '. 2002); 
the Robotic Optical Transient Search Ex periment, ROTSE 
jAkerlof et aLlbood: IWozniak et al.ll2004l) . t he Rapid Tele- 
scope s for Optical Response, RAPTOR dVestrand et alJ 
|2002), S tellar Astrophysics and Research on Exoplanets, 
STARE (lAlonso et alJl2003h and the Wide Angle Sear ch for 
Planets, WASP jKane et al.ll20oi IStreet et alJl2003t) . The 
largest of these projects are already capable of monitoring 
around 10* objects. To be of practical use it is important 
that the lightcurves acquired by these projects can be clas- 
sified automatically, so that a human user is able to filter 
the available data to discriminate lightcurves of interest. 

Previous attempts at automated classification have fo- 
cused on singly-periodic variable stars and have tended 
to rely on an analysis of parameters derived from the 
Ughtcurves, such as period and Fourier co-efficients. Clas- 
sification has been achieved by identifying specific regions 
in the multi-dimensional space defined by these param- 
eters that correspond to different classes of object. Ex- 
am ples of this approach include analysis of ROTSE data 
bv lAkerlof et alJ (l2000l) and analy s is of ASAS data by 



lEver fc Blakd J2002ll and lPoimans'kil EoO: 

In this paper we present an algorithm designed to clas- 
sify singly-periodic variable stars based directly on their un- 
parameterised folded lightcurves. Our algorithm uses an ar- 
tificial neural network, and is capable of forming clusters 
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of lightcurves with similar shapes within an arbitrary N- 
dimensional space. In Sect.|21we describe our algorithm in 
general terms, and in Sect.|21we describe and discuss its ap- 
plication to real and synthetic test data sets. Section|3 in- 
cludes a discussion of the choices made in a specific imple- 
mentation of the algorithm. Finally, in Sect. 2] we describe 
and test methods designed to identify clusters of lightcurves 
in our output maps. 



2 THE ALGORITHM 

The algorithm we have developed utilises a form of neu- 
ral network, na mely the self-organising map scheme due to 
iKohonenI lll990l) . We are motivated in this choice by two de- 
sirable properties of Kohonen maps; simplicity of implemen- 
tation and their ability to learn in an unsupervised regime. 
Unsupervised learning allows a network to be trained with- 
out the requirement for pre-classification of the training set, 
and free from any biases which might be introduced by a 
human trainer. In contrast to most supervised schemes, an 
unsupervised network is in principle capable of identifying 
and grouping previously unknown or unanticipated object 
classes within the training set. 

Previous applications of Kohonen self-organising maps 
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have also been used to classify the astronomical literature 
JPoincot et al.lll998^ . 

The self-organising map consists of a set of M neurons 
(or nodes). Associated with each neuron j is a reference vec- 
tor, Wj. When presented with a stimulus in the form of an 
input vector, Xi, each neuron "fires" with a strength that 
is related to the similarity between the input and the refer- 
ence vector. The reference vector of the most strongly firing 
neuron in the map therefore represents the closest match to 
the input vector. 

The neurons are arranged in an A'^- dimensional space, 
and we have chosen to arrange them in a rectangular lattice. 
Each dimension is cyclic, so for example in the case of a 2- 
dimensional map we could consider it to have the topology 
of a torus. Typically A'' is chosen to be 1, 2 or 3. Each neuron 
j has a Cartesian coordinate rj in this A'- dimensional space 
(hereafter referred to as map space). 

By presenting each member i of some input set to each 
neuron j in turn, we can ascertain which neuron most closely 
matches the input member and associate the coordinate of 
the neuron with the input member. In this way we can assign 
a location in map space to every member of the input set. 

In our algorithm the reference vectors Wj of the neurons 
represent template lightcurves, and the input vectors Xi the 
measured lightcurves of the astronomical objects we wish to 
classify. 

The reference vectors of the neurons are initially as- 
signed random values. The goal of training the network is to 
adjust the neuron reference vectors in such a manner that 
members of the input set that are "similar" are placed more 
closely together in map space than members that are dis- 
similar. 

The network is trained in an iterative fashion by ex- 
posing it to a training set. Each member of the training set 
is assigned a coordinate in map space corresponding to the 
neuron in the map that most closely matches the member. 
The reference vectors in the best-matching neuron, and its 
close neighbours, are updated by blending them with the in- 
put data. The algorithm continues until all members of the 
training set have been classified and have updated the net- 
work reference vectors. In the second and subsequent itera- 
tions the network is repeatedly exposed to the same training 
set, and the algorithm proceeds in this manner until some 
completion criterion is met. 

The blending process is a key element of the algorithm, 
as it is this that allows the network to "learn" , by adapting 
the contents of the neuron reference vectors to match the 
incoming data. The learning process is competitive; features 
in the reference vectors which lead to good matches with 
input set members tend to be reinforced. This reinforcement 
effect is stronger if the feature is shared by many members 
of the training set. As the network learns the coordinates 
of the members of the training set in map space evolve, and 
training set members which resemble each other tend to self- 
organise into clusters within the map space. 

2.1 Data preparation 

Before training of the network commences the training set 
of lightcurves are epoch folded on a previously identified 
period, and binned to match the cardinality of the the ref- 
erence vectors in the network neurons. The binned folded 



lightcurves are then normalised to the brightness range to 
1 magnitudes. These folding and normalisation steps yield a 
training set that is scale-free, and not complicated by issues 
of absolute mean brightness, nor the time-scale or amplitude 
of the variability present in the astronomical lightcurves. By 
taking these steps we ensure that the neural network is clas- 
sifying on shape alone. 

Where a population might be expected to contain mem- 
bers with lightcurves of similar shape but differing ampli- 
tude, one might choose to normalise on mean brightness 
alone, in order to allow the network to distinguish between 
these classes. 

2.2 Identifying the closest matching neuron 

During the training phase the algorithm must identify which 
of the template lightcurves (represented by the reference vec- 
tors Wj in the network neurons) is the closest match to the 
lightcurve of the training set member i currently under con- 
sideration. 

We achieve this by computing for every neuron j the 
value of a statistic, Sij^min, such that: 




where is the cardinality of the input and reference 
vectors, q and p allow for all possible phasings between the 
input and reference vectors, k = (q+p) mod n„, and Wjk is 
the fcth element of the reference vector of neuron j. 

The minimisation over p is necessary in our algorithm 
as we make no attempt in the data preparation stage to 
establish an absolute reference phase for our epoch-folded 
lightcurves. We feel that any attempt to do this might prove 
unreliable when working with noisy data. Rather we choose 
to compare the input lightcurve with the template at all 
possible phase offsets, and choose the minimum value of the 
statistic Sij. 

The best matching neuron c is that for which Sij.min is 
minimised over all of the M neurons in the map. 

2.3 Modifying the network 

Once the best-matching neuron c has been identified its ref- 
erence vector and that of its near neighbours are modified 
as follows: 

w'ij = Wij + a{t)h{rc ~ ri,t){x^ ~ uij-,) 

where a is the learning rate coefficient, h{r,t) is the 
neighbourhood kernel, and t is a time coordinate. We adopt 
a Gaussian form for the neighbourhood kernel 

h{r,t) = exp(-rV2cr^(t)) 

The time coordinate t varies linearly with iteration 
number, from t = during the first learning iteration, to 
t = 1 during the last iteration. 

2.4 Evolving a and cr 

The instantaneous rate at which the network learns is in- 
fiuenced by both the learning rate coefficient a, which con- 
trols the degree to which reference vectors are blended with 
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the input data, and a which determines the effective size of 
the neighbourhood. In a typical application of the Kohonen 
scheme both a and cr are chosen to be monotonically de- 
creasing functions of time, t. The combined effect of such 
a choice is that the large-scale structure in the map tends 
to form early in the training phase, with the finer details 
crystallising in the later stages of learning. 

In common with Kohonen we have found that the abil- 
ity of the network to learn successfully is not very sensi- 
tive to the precise functions chosen to evolve a and o. The 
key requisites are that a should not drop so quickly as to 
stop the learning process prematurely, and that ao should be 
chosen such that the neighbourhood kernel encompasses a 
sufficient fraction of the map to allow only large-scale struc- 
ture to form early in the learning phase. Subsequently the 
spatial scale of the learning must be reduced to allow the 
network to develop finer structure and settle into a stable 
state. We choose to reduce the width of the neighbourhood 
kernel linearly with time. 

We will show that a can evolve linearly or exponentially 
with time t, or even remain constant throughout the learning 
phase, with little impact on the ability of the map to self- 
organise in a robust manner. 



3 PERFORMANCE OF THE ALGORITHM 

We have tested our algorithm using two training sets. Firstly 
we have generated a sample of 5000 synthetic lightcurves 
with a distribution of shapes drawn from four distinct 
classes. The second training s et comp rised 1206 lightcurves 
from the ROTSE experiment (lAkerlof o t al. 2QQfl,). 

Our timing tests show that, for a given size of training 
set, the algorithm execution time for each iteration scales 
linearly with the number of neurons, M, and the cardinality 
of the neuron reference vectors. 

3.1 Configuration of tiie network 

The tests presented here were obtained using a two- 
dimensional network (A*' = 2) . The two-dimensional network 
is attractive as it provides sufficient freedom of expression to 
allow clusters to form, whilst ensuring that the results are 
straightforward to visualise and interpret (as 2-D images, 
for example). 

In the traditional Kohonen scheme, as adopted here, 
the number of neurons comprising the network is constant 
through the learning process and must be determined be- 
fore training commences. We have performed tests over a 
range of network sizes, from 5 x 5 to 60 x 60, and found that 
the choice does not strongly inffuence the network's ability 
to successfully form clusters. The key considerations are to 
ensure that sufficient neurons are available to allow the net- 
work to resolve the subtle differences in the features of the 
lightcurves, whilst avoiding a situation where the number 
of neurons is comparable to or exceeds the number of mem- 
bers in the training set. In post-processing cluster extraction 
stages (Section 2J it is beneficial to have enough nodes to 
resolve cluster boundaries but not so many that the average 
number of training set members per neuron drops so low 
as to make the effect of learning on the map negligible. We 
have found that a network size of 20 x 20 nodes is suitable 



for training sets numbering a few thousand members and is 
adequate for real-life datasets such as the ROTSE periodic 
variables. 

The cardinality of the neuron reference vectors, 
must also be chosen before training commences, and indeed 
before the training lightcurves are epoch folded and binned. 
We have found that the performance of the network is not 
strongly dependent on the choice of n„, subject to it be- 
ing large enough to allow the distinguishing features of the 
lightcurves to be resolved. We tried values of 32 and 64 el- 
ements for the reference vectors and found no difference in 
performance for the smoothly varying lightcurves in our test 
dataset. 

The final choices are the ranges and functional forms 
of the a and a learning parameters, and the number of it- 
erations carried out over these ranges. In our algorithm the 
number of iterations is not set by testing for a convergence 
criterion, but rather is fixed at the beginning of the run 
and acts to set the resolution of the evolution of a and a. 
In all the runs presented in this paper o is initially set to 
half the size of the network and is then reduced linearly to 
one quarter the separation of the neurons (i.e. the one-sigma 
full- width of the smoothing Gaussian is reduced from the full 
size of the network to half the spacing between neurons) . We 
made a number of different choices for a, described below. 

3.2 Tests with simulated lightcurves 

3.2.1 Synthetic dataset 

We simulate lightcurves with the following basic shapes (see 
Figure : 

(i) a single skewed Gaussian peak (representing stars of 
the RR Lyrae ab-type family); 

(ii) a single symmetric Gaussian peak (representing stars 
of the RR Lyrae c-type, 5 Scuti and Cepheid families); 

(iii) constant brightness with two broad Gaussian dips of 
equal width but differing depth (representing eclipsing close 
binary stars); 

(iv) as |(iii)| but with narrower dips (representing well- 
separated eclipsing binaries). 

These functions are not intended to represent accurately 
the real lightcurves of such systems, but merely to produce 
a test data set with the appropriate range of behaviour. 

For each synthetic lightcurve the characteristic features 
of its class (the width of peaks and dips, and the relative 
depths of the dips) were chosen randomly from within ranges 
chosen to yield profil es representative of those observed by 
ROT SE and ASAS jAkerlof et al.ll2000l: IPoTmanskil Il997l. 
l2002h . 

Noise is added to the synthetic lightcurves to represent 
photometric measurement errors. The distribution of signal- 
to-noise ratio in the set of synthetic lightcurves is chosen 
to represent a source population with a power-law cumu- 
lative distribution in flux with a slope of —1 (a uniformly 
distributed source population would have a steeper slope of 
—3/2). In terms of magnitudes our synthetic data have the 
following mean brightness distribution. 
2 

\ogN{< m„) cx -m„ 
o 

Our actual magnitude distribution is presented in Fig.[5| 




Figure 3. Evolution of a network trained with our synthetic data and with a{t) = = 0.01. Panels show (from left to right, top 
to bottom), iterations 1, 4, 7, 10, 13, 16, 19, 22, 25, 28, 31, 34. The four classes of synthetic lightcurve (described in Section i3.2.1l 
are denoted by (i) squares, (ii) crosses, (iii) diamonds and (iv) stars. Note that, for the purposes of visualisation, the positions of all 
lightcurves have been randomised within their host neurons. 
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Figure 1. Samples of the four classes of simulated lightcurves, 
representing type-c RR Lyrae, S Scuti and Cepheid types (top 
left), type-ab RR Lyrae (top right), short-period (bottom left) 
and long-period eclipsing binary (bottom right) 



As the network is evolved, and the kernel size is decreased, 
the network becomes increasingly free to develop substruc- 
ture which better represents the the range of shapes of 
the synthetic lightcurves. Between iterations 13 and 16 one 
of the initial clusters splits into two (the contact and de- 
tached eclipsing binaries), and between iterations 13 and 
22 the other initial cluster also gradually splits (the sym- 
metric and skewed Gaussian shapes). By iteration 22 the 
map well represents the four types of lightcurve. As the ker- 
nel size continues to decrease the clusters repel each other 
less strongly and begin to spread to occupy more of the 
available neurons. This is because sharper differences in the 
template lightcurves are now allowed between neighbouring 
neurons. The increasing size of the clusters allow them to 
better describe the range of sub-structure within each type 
of lightcurve, and smooth variations of lightcurve proper- 
ties are seen across the individual clusters. Finally, as the 
smoothing kernel decreases below the separation of individ- 
ual neurons, neighbouring neurons become decoupled and 
the clusters can make contact. At this point the network 
has evolved to the point that all of its neurons are repre- 
senting subtly different lightcurve shapes, and learning is 
complete. 



Figure 2. The lightcurve brightness distribution for the simu- 
lated dataset. 



3.2.2 Learning behaviour of the network 

Figure|21 shows the evolution of a typical network. In this 
example the a parameter is held constant at 0.01 through- 
out. The full run had a resolution of 34 iterations. Figure^ 
shows every third iteration and, for purposes of visualisa- 
tion, the positions of all lightcurves have been randomised 
within their host neurons. 

It can be seen in Fig.j^that the 5000 lightcurves ini- 
tially form two tight clusters. This is because the synthetic 
lightcurves naturally divide into two basic types (single and 
double peaked) and because the large smoothing kernel does 
not allow the network to form sufficient structure to resolve 
the lightcurve sub-types. The lightcurves are tightly clus- 
tered because the large smoothing length forces the clus- 
ters to strongly repel one another. Weak cross-shaped struc- 
tures are also apparent, and are a natural consequence of 
the wrapping of the Gaussian kernel in both dimensions. 



3.2.3 Monitoring learning performance 

The learning behaviour of our maps motivates us to track 
three measures which allow us to quantify the degree of re- 
organisation which is taking place at a given time, and the 
quality of the match between the template lightcurves and 
the training set members assigned to them. 

The first of these measures (S) is the value of Sic,min 
averaged over all members i of the training set. The statistic 
Sic,min can be regarded as a measure of the goodness-of-fit 
between a training set member and the template lightcurve 
of the network neuron c which most closely matches it. By 
averaging over all members of the training set we can there- 
fore measure how well the template lightcurves of the net- 
work as a whole have adjusted to represent the input data. 
We expect S to remain high during early iterations, to show 
a rapid fall once the network becomes free to represent all 
the lightcurve sub-types as separate groups, and then to ex- 
hibit a slower decline as finer adjustments are made to the 
template lightcurves. 

The second measure (nmove) is a count of the number 
of training set members that have changed location in map 
space from one iteration to the next. We expect maximum 
movement at times of maximum learning. 

The final measure (D) is the average Euclidean distance 
moved by a training set member in map space from one 
iteration to the next. Again, we expect maximum movement 
at times of maximum learning. 

Figures |1| |H| and |S| show the evolution of S, Umove and 
D as networks are trained using the synthetic lightcurves 
described above. Several training runs were made, differing 
only in the initial value chosen for a, and the functional form 
with which a evolves. We choose three functional forms for 
alpha: a constant value a{t) = ao, a linear form a{t) = 
Qo(l — t), and an exponential form a{t) — aoexp{ — Xt), 
where the scaling constant A is chosen such that a — 10~^ 
in the final iteration. 

Taken together these three figures illustrate that the 
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Figure 4. The average "goodness of fit" , S. In the leftmost col- 
umn the learning parameter a is held constant throughout the 
learning phase. In the central column a falls linearly, and in the 
rightmost column exponentially. The initial value of a assumes 
the value 0.9 (top row), 0.5 (second row), 0.1 (third row) and 
0.01 (bottom row). 

ability of the map to robustly self-organise is not critically 
dependent on the choice of either qq or the functional form of 
a{t). In particular Fig.|l|shows that the template lightcurves 
adjust to faithfully represent the input data in all cases. All 
panels in Fig. 3] show a monotonic improvement in the fit 
statistic that begins relatively slowly while cr is high, be- 
comes more rapid when a reaches a critical value that allows 
all types of lightcurve to be represented by their own group- 
ing in the network, and finally slows as the maps approach 
a steady state at the end of the learning process. The final 
value of S is similar in all cases except in Fig. • This run 
refers to an exponential decrease in a from 0.01 to 0.001. It 
is clear from this panel that a can drop too fast too soon, 
thereby "freezing in" structure in the network and prema- 
turely ending learning. For this reason we favour a constant 
a{t) that allows equal learning at all stages of the learning 
process. We also do not use a values substantially below 
0.01 (although lower values may be acceptable as long as 
the map is given sufficient time to evolve into equilibrium). 

Figures |5] and |S] reveal more details of the learning pro- 
cess. Once again, most panels show broadly the same be- 
haviour, with most learning occurring during the middle 
of the run, and less at the beginning and ends. It is clear, 
however, that movement of lightcurves within the map is a 
function of a. For ao ^ 0.2 the evolution of the map appears 



Figure 5. The number of training set members that move from 
one iteration to the next, Umove- Panel order is the same as Fig- 
ure |1] 

to be stable, with individual clusters moving and evolving 
slowly and consistently with iteration. For larger values of 
ao, however, we have noticed that although large-scale struc- 
ture does indeed form, that the cluster pattern as a whole 
continues to migrate rapidly through map space (Figure 
03,131 In a real- world application this mobility and 

constant restructuring would quite likely prove to be unde- 
sirable. For this reason we prefer to choose values of ao ^ 0.2. 

The rapid learning apparent in the middle iterations 
in Figs. |1]|3&|S| reveals the presence of a critical value of a 
which allows the network sufficient freedom to well repre- 
sent all classes of lightcurve. This critical a is likely to take 
a different value for different datasets (more classes would 
require more freedom and a smaller (Tcrit) but nevertheless, 
in a real-world application, one might choose to save com- 
puting effort by starting cro at a value closer to (Jcrit. 

There is evidence of some misclassification in the fi- 
nal trained network in Fig.|3 for example between skewed 
and non-skewed Gaussian profiles. This is not unexpected as 
some members of the skewed Gaussian class will exhibit a 
very small degree of skewness, and are difficult to distinguish 
from the non-skewed class members even when inspected by 
eye. Misclassification is difficult to avoid in any situation 
where there is a smooth trend in variation of light-curve 
shape between two classes, emphasising that in real-world 
applications shape-based classification would be used along- 
side other diagnostic attributes. Nevertheless as a whole the 
algorithm has performed remarkably well. 
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Figure 6. The average Euclidean distance moved by a training 
set member in map space from one iteration to the next, D. Panel 
order is the same as Figure HI 



3.3 A test with real lightcurves 

In order to test the classification abilities of the self- 
organising map using real data we chose to train it us- 
ing lightcurves from the ROTSE experiment. The training 
set consisted of 1206 periodic lightcurves w hich had been 
indep endently pre-classified by other means jAkerlof et alJ 
120001) . The externally derived classifications were not used 
in training our network, however they did allow us to as- 
sess the ability of the algorithm to group and classify class 
members after training is complete. 

FigureQshows the final state of a network trained using 
the ROTSE lightcurves. All parameters were set to the same 
values as the run in Fig.|3] in particular a was held at a value 
of 0.01 throughout. It can be seen that our algorithm has 
successfully differentiated the contact binaries, the detached 
eclipsing binaries, and the type-ab RR Lyrae systems. The 
remaining three classes in the sample, Cepheids, S Scuti and 
type-c RR Lyrae stars, possess very similar lightcurve pro- 
files (indeed the variability is driven by the same underlying 
physical process), therefore the network cannot distinguish 
between the classes on shape alone. In such cases, however, 
the use of additional information can be used to resolve the 
ambiguity. In Fig. |H|we show an example of how this addi- 
tional information can be used. This figure shows the map 
of Fig. QcoUapsed in one dimension and expanded with the 
period of the variable star. The S Scuti, Cepheid and type-c 
RR Lyrae variables, which formed an overlapping group in 
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Figure 7. The final structure of a network trained using 1206 
pre-classified ROTSE lightcurves. 

Fig. |7| have been separated in period with clearly defined 
cut-off boundaries for each type. 

Some misidentification of lightcurves is apparent in 
Fig. |7| We have investigated a number of cases and are con- 
fident that our maps have classified the data correctly. The 
discrepancies seem to be the result of occasional misclassifi- 
cation in the original ROTSE analysis. 



4 AUTOMATED EXTRACTION OF 
CLUSTERS 

Once training is complete and the map has self organised, 
it may in some applications be desirable to proceed further 
to automatically identify "clusters" within the map. Such 
a cluster extraction step would then allow the automatic 
assignment of a classification to each member of the input 
set. 

The most obvious approach to cluster extraction is a 
direct examination of the distribution of training set mem- 
bers in map space. One potential drawback of this approach 
is that the Kohonen scheme tends to spread the input mem- 
bers fairly uniformly throughout the map space (Fig. |21 see 
also Kohonen 1,990,') . This could be a particular problem in 
the case of small or noisy datasets, such as the ROTSE 
data (Fig. [7|l; in such cases the boundaries between clus- 
ters could become difficult to identify. Figure |^ shows the 
number of training set members assigned to each neuron for 
the ROTSE data set. A small degree of smoothing has been 
applied to this map to reduce noise due to small-number 
statistics in the map bins. The clusters and cluster bound- 
arie s are readily visible. 

lUltsch fc Siemoiil lll990h outline a different approach to 
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Figure 8. The ROTSE map of Fig.[7|collapsed in one dimension 
and expanded with the period of the variable star. Here the 5 
Scuti, Cepheid and type-c RR Lyrae variables, which formed an 
overlapping group in Fig.|7| have been separated in period. This 
illustrates how additional information can be used to resolve am- 
biguity in our maps, which are based on shape alone. 
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Figure 9. A map of the number of training set members assigned 
to each neuron for the network trained with the ROTSE data. The 
raw map has been filtered with a 3x3 top-hat kernel. The colour 
scale runs from black (low occupancy) to red (high occupancy). 



Figure 10. The final iteration of the ROTSE data map (shown in 
Fig.|7J when processed with the U-Matrix algorithm. The colour 
scale runs from dark blue (similar reference vectors) to red (dis- 
similar reference vectors). 



the problem by calculating what they call the U-matrix. In 
essence the U-matrix attempts to capture the rate of change 
of shape in the neuron reference vectors across the map 
space; within clusters adjacent neurons will have reference 
vectors which are rather similar to each other, whereas at 
boundaries the differences will be more marked. 

The difference between the reference vector of a neuron, 
i, and that of one of its near neighbours, j, can be calculated 
as; 



1 



In a two-dimensional map each neuron will have eight 
nearest neighbours, so we calculate L for each neighbour in 
turn and take the average. Figure 1101 shows the resulting 
map calculated from the same trained network as Figure 1^1 
The U-matrix map also readily reveals the clusters and their 
boundaries, and with noticeably less noise than the number 
density distribution map. 

Both the number density and the U-matrix maps would 
be suitable for further processing to identify contiguous ar- 
eas enclosed within the cluster boundaries. It is worth noting 
however that because the U-matrix is comparing the shape 
of the templates, rather than number counts in the training 
set, the resulting map is less susceptible to noise in cases 
where the training set is of limited size. 



5 CONCLUSIONS 

We have applied the Kohonen self-organising map scheme to 
the classification of singly periodic astronomical lightcurves. 
Our algorithm has proved capable of reliably classifying both 
synthetic and real lightcurves. The investigation has also 
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shown that the ability of the map to robustly self-organise 
is not strongly dependent on the parameters used to control 
the learning process. 

We conclude that the self-organising map will prove a 
valuable addition to the data-mining toolset for future large 
time-domain datasets. 



Kibrick, Robert I. Proceedings of the SPIE, Volume 4845, 
pp. 126-136 (2002). The RAPTOR experiment: a system 
for monitoring the optical sky in real time. p. 126 
Wozniak P. R., et al., 2004, ApJ, in press 
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